cd/entity/Card et al.ยท homeโ€บ entitiesโ€บ Card et al.
grep -l @card et al. /news/*.json | wc -l โ†’ 1

Card et al.

mentions 1 type Person feed RSS

// recent coverage 1 mentions

06:32
2026-06-24
dev.to
machine-learning

Bootstrap confidence intervals for your LLM eval metrics

Nexus Labs' fine-tuning and evaluation team lead demonstrated that a single evaluation metric like 84.2% accuracy on a 500-example set carries significant uncertainty, with a 95% bootstrap confidence โ€ฆ

// co-occurs with top 3 entities